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t-h '■ ABSTRACT 

We analyzed the spatial distribution of 28500 photometrically selected galaxies with magnitude 23.5 < 
< 25.5 and redshift 1.4 < z < 3.5 in 21 fields with a total area of 0.81 square degrees. The 
galaxies were divided into three subsamples, with mean redshifts z — 1.7, 2.2, 2.9, according to the 
U n G1Z selection criteria of Adelberger et al. (2004) and Steidel et al. (2003). Combining the galaxies' 
measured angular clustering with redshift distributions inferred from 1600 spectroscopic redshifts, we 
' find comoving correlation lengths at the three redshifts of ro = 4.5 ± 0.6, 4.2 ± 0.5, and 4.0 ± 0.6/i -1 

Mpc, respectively, and infer a roughly constant correlation function slope of 7 = 1.6 ± 0.1. We derive 

~~ — similar numbers from the 1600 object spectroscopic sample itself with a new statistic, K, that is 

insensitive to many possible systematics. Galaxies that are bright in 7Z (A rcs t ~ 1500-2500A) cluster 
more strongly than fainter galaxies at z = 2.9 and z = 2.2 but not, apparently, at z = 1.7. Comparison 
to a numerical simulation that is consistent with recent WMAP observations suggests that galaxies in 
our samples are associated with dark matter halos of mass 10 11 ' 2 -10 11 - 8 M Q (z = 2.9), 10 11-8 -10 12 ' 2 M© 
(z — 2.2), 10 —10 3 Mq [z — 1.7), and that a small fraction of the halos contain more than one 
galaxy that satisfies our selection criteria. Adding recent observations of galaxy clustering at z ~ 
and z ~ 1 to the simulation results, we conclude that the typical object in our samples will evolve into 

• i-h , an elliptical galaxy by redshift z = and will already have an early- type spectrum by redshift z = 1 . 

We comment briefly on the implied relationship between galaxies in our survey and those selected 
5h \ with other techniques. 

Subject headings: galaxies: evolution — galaxies: formation — galaxies: high-redshift — cosmology: 
large-scale structure of the universe 



1. INTRODUCTION 

Early investigators studied the spatial distribution of 
galaxies because they hoped to learn about the structure 
of the universe on the largest scales. Their influential 
work was superseded, in the end, by its competition. 
Problems began with the demonstration that galaxies 
contained only a small fraction of the matter in the uni- 
verse. Galaxy formation remained too poorly understood 
to quell doubts about how faithfully galaxies traced un- 
derlying distribution of dark matter. Other observations 
improved — gravitational lensing, peculiar velocities, in- 
tergalactic absorption lines, and so on — and seemed eas- 
ier to relate to matter fluctuations. Computers became 
fast enough to predict the evolution of the large-scale 
matter distribution from the the initial conditions that 
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microwave-background missions were measuring with in- 
creasing precision. As it became clear that the simula- 
tions and observations agreed remarkably well, most re- 
searchers concluded that the large-scale structure of the 
universe could be understood completely as the product 
of gravitational instability amplifying small inflationary 
perturbations. Galaxies, once believed to be the primary 
constituent of the universe, came to be seen as small test 
particles swept into ever larger structures by converging 
dark matter flows. 

The spatial distribution of galaxies remains interesting 
because it can teach us about galaxy formation. Galaxy 
formation must be closely related to larger process of 
gravitational structure formation, since the formation of 
a galaxy begins with gas streaming into a massive po- 
tential well and ends with stars drifting in the cosmic 
flow. Lessons from 20 years of numerical investigations 
into structure formation should therefore carry over to 
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the analysis of galaxy clustering. One example is the 
known positive correlation between clustering strength 
and mass for virialized dark matter halos. Since galax- 
ies reside in dark matter halos, their clustering strength 
provides an indication of the mass of the halos that con- 
tain them. The resulting mass estimate depends on the 
assumptions that the microwave background and other 
observations have given us reliable estimates of cosmolog- 
ical parameters and of the initial matter power-spectrum, 
that numerical simulations can correctly trace the evo- 
lution of the matter distribution, at least for moderate 
densities, and that no process can significantly separate 
baryons from dark matter on Mpc scales — assumptions 
that are at least as plausible as those behind compet- 
ing techniques for mass estimation. Another example 
is the evolution in the clustering strength of a popula- 
tion of galaxies once it has formed. This is driven solely 
by gravity and is easy to predict from numerical sim- 
ulations. Comparing the clustering of (say) galaxies in 
the local universe and those at high redshift can there- 
fore suggest or rule out possible links between them. Few 
other methods are as useful for unifying into evolutionary 
sequences the galaxy populations we observe at different 
look-back times. An excellent review of the history of 
these techniques has been written by Giavalisco (2002; 
pp 620-624). 

This paper has two aims. The first is to present mea- 
surements of the clustering of UV-selected star-forming 
galaxies in a redshift range 1.4 < z < 3.5 that is only 
partially explored. Section describes the way we ob- 
tained our data, §|3]describes and justifies the techniques 
we used to estimate the spatial clustering in our galaxy 
samples, and § 0] presents our estimates of the galaxy 
correlation function at redshifts z ~ 1.7, z ~ 2.2, and 
z ~ 2.9. The survey analyzed here is several times larger 
than its predecessors; the surveyed area of 0.81 square 
degrees is roughly 700 times larger than the area ana- 
lyzed by Arnouts et al. (2002) and 4 times larger than 
the areas analyzed by Giavalisco & Dickinson (2001) and 
Ouchi et al. (2001). The second aim is to discuss what 
our measurements imply about the galaxies and their de- 
scendants. In § 15. II we show that the galaxies' correlation 
functions are indistinguishable from those of virialized 
dark matter halos with mass M ~ 10 12 M©. In § 15.21 wc 
show that the galaxies, dragged by gravity for billions of 
years, caught in the press of structure formation, would 
by redshift z = have a correlation function that is in- 
distinguishable from that of the elliptical galaxies that 
surround us. Our results are summarized in § [5J 

2. DATA 

2.1. Observed 

The data we analyzed were drawn from our ongoing 
surveys of high-redshift star-forming galaxies. A brief 
description of the surveys follows; see Steidel et al. (2003, 
2004) for further details. Deep, multi-hour U n GlZ images 
of 21 fields scattered around the sky were obtained with 
various 4m-class telescopes (table ^| . Tens of thousands 
of objects were visible in these images. For the analysis 
of this paper we ignored all but the subset (~ 20%) with 
AB magnitude 

23.5 <1Z< 25.5 (1) 
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Fig. 1. — Redshift distributions of objects w hose colors satisfy 
the three sets of selection criteria presented in § 12.11 



and AB colors satisfying the "LBG" selection criteria of 
Steidel et al. (2003), 

U n -G>G-Tl + 1.0 
G-n<1.2, (2) 

the "BX" selection criteria of Adelberger et al. (2004), 

G-1Z> -0.2 
U n -G>G~1Z + 0.2 

G -K<0.2{U n -G)+0A 
U n -G<G -K + 1.0, (3) 

or the "BM" selection criteria of Adelberger et al. 
(2004), 

G-1Z> -0.2 
U n — G>G — 1Z — 0A 

G-lZ<0.2{U n -G) + 0A 
U n -G<G-1Z + 0.2. (4) 

In this range of TZ magnitudes, the colors are charac- 
teristic of galaxies at 1.4 ;$ z ;$ 3.3. (The restriction to 
TZ > 23.5 helps eliminate most interlopers; see Adel- 
berger et al. 2004 and Steidel et al. 2004.) By obtaining 
spectra of more than 1600 galaxies with these colors, Stei- 
del et al. (2003, 2004) established that the mean redshift 
and ±1(7 range of galaxies in the three samples is 

(2.94 ±0.30 LBG 
z±a z = I 2.24 ±0.37 BX (5) 
I 1.69 ±0.36 BM. 

(These values exclude galaxies with TZ < 23.5 or TZ > 
25.5 as well as the handful of low-redshift "interloper" 
galaxies with z < 0.8.) Redshift histograms are shown in 
figure n We will use the term "photometric candidates" 
to describe the objects with 23.5 < TZ < 25.5 whose 
colors satisfy one set of the selection criteria presented 
above, and the term "spectroscopic sample" to describe 
the subset of photometric candidates that had a spec- 
troscopic redshift measured by Steidel et al. (2003) or 
Steidel et al. (2004). Although the spectroscopic sample 
is sizeable, it contains only a small fraction ( ,$ 10%) of 
the photometric candidates (see figure |21 and table 

All redshifts were measured with the Low-Resolution 
Imaging Spectrograph (LRIS; Oke et al. 1995) on the 
Keck telescopes. The number of redshifts in each field 
was determined by the number of clear nights that were 
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Table 1. Observed fields 



Name 


Af2 a 


Jvlbg 


Jvbx c 




2lbg s 




Ibm c 


3c324 


6.6 x 6.6 


11/49 


0/166 


0/126 


0.0035 


0.0034 


0.0042 


B20902 


6.3 x 6.5 


31/65 


1/207 


0/189 


0.0036 


0.0035 


0.0043 


CDFa 


8.7 x 8.9 


34/99 


0/336 


0/280 


0.0029 


0.0028 


0.0036 


CDFb 


9.0 x 9.1 


20/120 


0/316 


0/273 


0.0028 


0.0028 


0.0035 


DSF2237a 


9.0 x 9.1 


39/100 


1/367 


0/328 


0.0028 


0.0028 


0.0035 


DSF2237b 


8.9 x 9.1 


44/161 


1/516 


0/309 


0.0028 


0.0028 


0.0035 


HDF 


10.4 x 14.4 


54/187 


128/735 


37/587 


0.0022 


0.0022 


0.0028 


Q0201 


8.6 x 8.7 


18/90 


4/339 


0/285 


0.0029 


0.0029 


0.0036 


Q0256 


8.5 x 8.4 


45/126 


1/346 


0/243 


0.0030 


0.0029 


0.0037 


Q0302 


15.6 x 15.7 


46/824 


0/1778 


0/749 


0.0018 


0.0018 


0.0023 


Q0933 


8.9 x 9.2 


63/192 


0/435 


0/273 


0.0028 


0.0028 


0.0035 


Q1307 


16.3 x 16.0 


16/483 


47/1352 


9/936 


0.0017 


0.0017 


0.0023 


Q1422 


7.3 x 15.6 


96/253 


1/728 


0/491 


0.0024 


0.0024 


0.0030 


Q1623 


12.0 X 22.3 


6/462 


189/1220 


2/847 


0.0016 


0.0016 


0.0022 


Q1700 


15.4 x 15.4 


15/406 


62/1456 


1/948 


0.0018 


0.0018 


0.0024 


Q2233 


9.2 x 9.2 


44/76 


1/267 


0/181 


0.0028 


0.0028 


0.0035 


Q2343 


22.8 x 11.5 


10/385 


148/938 


8/541 


0.0016 


0.0016 


0.0022 


Q2346 


16.5 x 17.1 


1/362 


34/1142 


1/754 


0.0016 


0.0017 


0.0022 


SSA22a 


8.6 x 8.9 


42/151 


10/360 


1/253 


0.0029 


0.0028 


0.0036 


SSA22b 


8.6 x 8.9 


29/73 


5/281 


1/308 


0.0029 


0.0028 


0.0036 


Westphal 


14.9 x 15.1 


172/270 


43/724 


20/632 


0.0018 


0.0018 


0.0024 


Total 


2907 


836/4934 


676/14009 


80/9533 


0.0021 


0.0021 


0.0029 



a Area imaged, in square arcmin 

b Number of sources whose colors satisfy equation^in the field's spectroscopic/photometric catalog 
c Number of sources whose colors satisfy equation |3J in the field's spectroscopic/photometric catalog 
d Number of sources whose colors satisfy equationlHin the field's spectroscopic/photometric catalog 

c The expectatio n v alue of the integral-constraint correction for the three samples if galaxies had bias 6 = 1. The 'Total' row shows the 
value of equation 1161 



allocated. Photometric candidates were selected for spec- 
troscopy more-or-less at random, but in one way the 
selection was far from random: spectroscopic objects 
in each field were constrained to fit together in a non- 
interfering way on one of a small number of multislit 
masks. This introduced artificial angular clustering to 
the spectroscopic samples, particularly in fields where 
our image's size significantly exceeded the spectrograph's 
~ 8' field-of-view. In some cases the artificial angular 
clustering was increased by our desire to obtain partic- 
ularly dense spectroscopic sampling in some parts of an 
image, e.g., near a background QSO. 

Table □ lists the number of BM, BX, and LBG pho- 
tometric candidates and spectroscopic galaxies in each 
field. The field-to-field variations in the number of pho- 
tometric candidates per square arcminute were caused 
primarily by differences in exposure times, seeing, sky 
brightness, telescope plus instrumental throughput, and 
so on, from one run to the next. Recall that we used 
many different telescopes and cameras during our imag- 
ing survey. The expected variations in intrinsic surface 
density (also shown in the table, and calculated for galax- 
ies with bias b = 1 as described in § 13.11 below) are sig- 
nificantly smaller. 

We estimated the angular correlation functions from 
the lists of photometric candidates. Inferring a comoving 
correlation length ro from the measured angular cluster- 
ing required an estimate of the objects' redshift distribu- 
tion. For this we took the measured redshift distributions 
of the spectroscopic samples. Since the spectroscopic 
samples are large — several hundred for the BX and LBG 
criteria, nearly 100 for BM — random fluctuations are un- 
likely to have given redshift distributions to them that 
are significantly different from those of the parent photo- 
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Fig. 2. — The number of objects in the spectroscopic and pho- 
tometric samples as a function of TZ magnitude. 
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metric samples. We were able to measure a redshift for 
only ~ 80% of the objects we observed spectroscopically, 
however, and it is therefore possible that various system- 
atics (e.g., difficulties measuring spectroscopic redshifts 
for galaxies in certain redshift ranges) could have caused 
the spectroscopic and photometric samples to have some- 
what different redshift distributions. Repeated observa- 
tions of a subset of the initial spectroscopic failures show 
that these objects have the same redshift distribution as 
the initial successes, implying that any systematics are 
not severe. 

2.2. Simulated 

To help us interpret our observations, we refered at 
times the GIF-LCDM numerical simulation of structure 
formation in a cosmology with CIm = 0.3, f^A = 0.7, h — 
0.7, r = 0.21, and as = 0.9. This gravity-only simulation 
contained 256 3 particles with mass 1.4 x 10 10 h~ 1 M ( ? ) in 
a periodic cube of comoving side-length 141. 3/i -1 Mpc, 
used a softening length of 20h~ 1 comoving kpc, and was 
released publicly, along with its halo catalogs, by Frenk 
et al. astro-ph/0007362 Further details can be found in 
Jenkins et al. (1998) and Kauffmann et al. (1999). Since 
the GIF-LCDM cosmology is consistent with the Wilkin- 
son Microwave- Anisotropy Probe results (Spergel et al. 
2003), and since modeling the gravitational growth of 
perturbations on large (~ Mpc) scales is not numerically 
challenging, we will assume that the growth of structure 
found in this simulation closely mirrors the growth of 
structure in the actual universe. 

For our purposes the most interesting aspect of the sim- 
ulation is the spatial distribution of virialized "halos" , or 
overdensities with Sp/p ~ 200, since these deep potential 
wells are the sites where galaxies can form from cool- 
ing gas. The public halo catalogs were created, by the 
GIF team, by running a halo-finding algorithm at vari- 
ous time steps in the simulation. We will say that all the 
halos identified by the algorithm at time-step t = 156 
(say) had t = 156 as their time of identification. Cor- 
relation functions for halos at the time of identification 
were calculated directly from the public halo catalogs. In 
subsequent time-steps these halos were progressively dis- 
placed by gravity. Some grew; others were destroyed as 
they were subsumed into larger structures. Any galaxies 
within the halos would be likely to survive intact, how- 
ever, and it is interesting to trace the expected evolution 
in their correlation function over time. To do this, we as- 
sumed that the galaxies in a halo would be displaced by 
gravity by the same amount, and in the same direction, 
as the halo's most bound particle. If {pi} denotes the set 
of particles that were the most bound particle in a halo 
at ^earlier, we assumed that the correlation function at 
time ti a ter of the galaxies that lay within halos identified 
at time t eal \i el would be roughly equal to the correlation 
function of particles {pi} at time iiater- At spatial separa- 
tions that are large compared to the typical halo radius, 
the expected evolution of the galaxy correlation function 
is insensitive to the details of this procedure. These are 
the only spatial separations we will consider. Our focus 
on large spatial scales also justifies our ignoring the possi- 
bility of galaxy mergers. Although mergers can strongly 
affect the correlation function on small scales, on large 
scales the effect is more subtle. It can be understood 
as follows. At time tiaterj the galaxies identified at time 



^earlier will be found in halos with a range of masses, and 
their large-scale correlation function will be a weighted 
average of the correlation functions of the halos. Since 
the weighting depends on typical number of descendant 
galaxies in halos of each mass, and since halos with differ- 
ent masses have different correlation functions, the large- 
scale correlation function of descendant galaxies will be 
altered by mergers if the merger frequency depends on 
halo mass. In practice, however, the difference in correla- 
tion functions between the more massive and less massive 
halos that host the descendants is not enormous, and as a 
result the merger frequency would have to be an implau- 
sibly strong function of halo mass to alter the descendant 
correlation functions on large scales in a significant way. 

3. METHODS 

Two approaches will be used to estimate the clustering 
strength. The first approach, which is standard, relies 
almost exclusively on the angular positions of the galax- 
ies. The second relies almost exclusively on the galaxies' 
measured redshifts. These approaches exploit different 
aspects of our data and are subject to different system- 
atics. The level of agreement between them provides an 
important test of our conclusions' robustness. This sec- 
tion describes and justifies the two approaches. Readers 
interested primarily in our scientific results may wish to 
skip ahead to § 0] 

3.1. Angular clustering 

The observed clustering of galaxies on the plane of the 
sky is related to the galaxies' three-dimensional corre- 
lation function in a straightforward way. Let z denote 
a galaxy's redshift and denote its angular position. 
is written in bold face because two numbers (e.g., 
right ascension and declination) are required to specify 
the galaxy's angular position. If P(zi0i |z2©2) is the 
probability that a galaxy at known position (z2 , ©2) has 
a neighbor at position (z±,&i), then elementary identi- 
ties show that the probability that a galaxy at angular 
position ©2 will have a neighbor at angular position ©1 
is 



P(©i|0 2 ) = J dz 1 dz 2 P(z 2 \&2)P(z 1 ® 1 \z 2 & 2 ) 



(6) 



Observations indicate that the reduced correlation func- 
tion is well approximated by an isotropic power law, 

P(2i0i|^02)^-P^i0i)[l + (ri2/r o )- 7 ], (7) 

where ro and 7 parametrize the shape of the power law 
and r\2 is the distance between (zi,©i) and (22,02). 
This implies, in the circumstances of interest to us, that 
the reduced angular correlation function will also be a 
power law, 

P(0i|0 2 ) = P(® 1 )(l + A6 u p ) (8) 

with/3 = 7— 1. If the angular separation = |©i— 02 
between the galaxies is small, #12 <Si 1, and if the comov- 
ing correlation length ro does not change significantly 
from the front to the back of the survey, then 

r 

'9 



A- 



dzN 2 ^)/ 1 -*' 



dz'N(z') 



(9) 
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(see, e.g., Totsuji & Kihara 1969) where N(z) is the sur- 
vey selection function, B is the beta function in the 
convention of Press et al. (1992), g(z) = c/H(z) is 
the change in comoving distance with redshift, f(z) = 
(1 + z)Da(z) is the change in comoving distance with 
angle, and Da(z) is the angular diameter distance. This 
follows from the relationship dz [r 2 J(R 2 + z 2 )] 7 / 2 = 

r^B(l/2,( 7 -l)/2).^ 

Our first approach to estimating the three dimensional 
clustering strength will be to measure the parameters 
A and (3 of the reduced angular correlation function 
oj(9) = A8~P, then infer values for tq and 7 using the 
relationships above. Our estimates of co(0) in different 
angular bins will be based on the Landy-Szalay (1993) 
estimator 

_ DD(6) - 2DR{6) + RR{0) 

WisW = rW) (10) 

where DD{9) is the observed number of unique galaxy 
pairs with separation 6-A6/2 < 6 < 6+A6/2, DR(Q) is 
the number of unique pairs with separation in the same 
range between the observed galaxy catalog and a galaxy 
catalog with random angular positions, and RR(6) is 
the number of unique pairs in the random catalog with 
separations in the same range. In practice we reduce 
the noise in the random pair counts by creating random 
catalogs with many times more objects than the data 
catalogs (n ran d/ndata ~ 100), calculating DR and RR, 
then multiplying DR and RR by (njata — l)/^rand and 

^data(n d ata ~ 1)/ [™rand ("rand ~ 1)]) respectively. 

Integral constraint 

Unless fluctuations on the size of our typical field-of- 
view are negligible, the number of detected galaxies in 
any field will be somewhat higher or lower than in a fair 
sample of the universe, and the number of galaxies in 
the field's ideal random catalog would therefore be lower 
or higher than the observed number. As a result the 
values DR and RR that we calculate with our approach 
will be incorrect to some degree. In a single field this 
can make the clustering appear stronger or weaker than 
it truly is, but when many fields are averaged it tends 
to make the observed clustering appear artificially weak. 
This can be shown as follows. Assume that the observed 
mean density in a field differs from the global average by 
the unknown factor 1 + S, i.e., p ODS = p(l + 6), and let 
DR and RR be the pair counts calculated from scaling 
the random catalogs to the observed density. One might 
guess that the estimator of em lationl 101 ought to have DR 
and RR replaced by values corrected to the true mean 
density, i.e., by DR/{l + $) and RR/(l + d) 2 , and indeed 
Hamilton (1993, §3) has shown that the estimator 

_ DD - 2DR/(1 + 6) + RR/(1 + S) 2 
Widcal = RR/{\ + 5) 2 

is equal to the true angular correlation function on aver- 

(Wideal(0)> = (12) 

This equation does not help us directly, since we do not 
know 8 and cannot calculate o^eai, but it does show that 
the estimator luls(8) must be biased: 

(ulsW) = u(6) -* 2 ^- "(«0 - * 2 (13) 



where a 2 = Var(<5) and the approximation assumes the 
weak clustering (w < 1) limit. It is therefore customary 
to estimate u>(9) by adding a constant X = uj(9) —u>ls{9) 
to the calculated values lols- The constant X depends on 
the unknown values of 5 in the observed field or fields and 
cannot be calculated exactly. If a 2 <C 1, so that field- 
to-field fluctuations are in the linear regime and have a 
nearly Gaussian distribution, and if our data are drawn 
from n independent fields with measured pair counts 
DDi, DRi, and RRi, then the value of X appropriate 
to a given angular bin in our data set will have a vari- 
ance of 



Var(I) : 



1 



RR*. 



i=l 



[(4a, 2 + 2crf)DD 2 - 8a 2 DD i DR l 



+4a 2 DRf 



RRtot 



£2^D 2 

i=l 



around its expectation value 



(I) 



1 



V a?DDi 



(14) 
(15) 



(16) 



i=l 



Here RRtot = Y17=i ' s the sum over all fields of 
the random pair counts in the chosen angular bin. In 
practice (I) and Var(X) depend very weakly on which 
bin is chosen. Below we will take (X) at 100" as our 
best guess at the correction X. When it matters we will 
discuss the effect of the uncertainty in X. 

We use two approaches to estimate the size of the un- 
certainty <Ji in the mean galaxy density of the ith field. 
Since 

(17) 



a 2 ee -L / dfMn 2W (# 12 ), 



it can be estimated numerically as 



(18) 



if w(9i) is known (Infante 1994; Roche & Eales 1999). 
Unfortunately the iterative approach suggested by equa- 
tions ^] and [^] can be unstable, at least when the cor- 
relation function slope is allowed to vary: a large value 
of u>ls w iU imply a large correction a 2 , which implies 
an even larger lols and even larger correction, and so 
on. The instability is undoubtedly worse for large im- 
ages, where the estimate of the integral constraint cor- 
rection for one iteration is completely dominated by the 
assumed correction from the previous. We were unable 
to use equation 1181 as anything other than a consistency 
check. 

A more robust estimate of a 2 follows from theoretical 
considerations. Since matter fluctuations will still be in 
the linear regime on the large scales of our observations, 
the relative variance of mass from one surveyed volume 
to the next can be estimated from the linear cold-dark 
matter power-spectrum Ph{k) (Bardeen et al. 1986; we 
adopt the parameters T = 0.2, erg = 0.9, n = 1) with 
Parseval's relationship 



2 

°CDM 



(27T) 



d 3 kP L (\k\)\W*(k)\ 



(19) 
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where Wk is the Fourier transform of a survey volume. 
The shape of the observed volume in any one of our fields 
is reasonably approximated in the radial direction by a 
Gaussian with comoving width (rms) l z and in the trans- 
verse directions by a rectangle with comoving dimensions 



I, 



x ly. In this case 



W k = exp 



k z l z 



sm(k x l x /2) sm(k y l y /2) 
fox^x/*^ ^y^yf^ 



(20) 



The implied value of crQ DM for each sample in each field is 
shown in tabled the values assume the powerspectrum 
normalization required for the r.m.s. fluctuation as a 
function of redshift in spheres of comoving radius 8ft, -1 
Mpc to obey ag(z) = ct 8 (0)-Dl(z) where ers(0) = 0.9 
and Dl(z) is the linear growth factor to redshift z. The 
desired corrections a 2 are then given by 

a 2 ~ b 2 a 2 CDM (21) 

where &, the galaxy bias, is calculated from the ratio 
of galaxy to matter fluctuations in spheres of comoving 
radius 8ft -1 Mpc: 



(78 (z)' 



Here the galaxy variance 



72{r /8h- 1 Mpc) 7 
(3- 7 )(4- 7 )(6-7)2^ 



(22) 



(23) 



(Peebles 1980 eq. 59.3) can be derived from the fit to the 
galaxy correlation function. This approach also requires 
an iterative solution, since the correction a 2 to to(6) de- 
pends on uj(8), but the advantage is that the assumed 
size of large-scale fluctuations is anchored in reality by 
our requirement that the slope of the correlation function 
match other observations on very large scales. 

Uncertainties in the selection functions 

The discussion so far assumes that we will know the 
precise shape of the selection function N(z). In fact this 
is not true, and uncertainty in the true shape of our se- 
lection function is a source of error in the derived values 
of ro. 3 A larger width for the selection function means 
that projection effects are stronger, and therefore implies 
a larger value of ro for given angular clustering (see equa- 
tionEJ). If the selection function is a Gaussian with mean 
[i and standard deviation cr se i, and if the weak redshift 
variations of / and g can be ignored, then the constant 
A in equation El is proportional to c^ 1 , and the implied 

value of rg is proportional to . Measuring n red- 
shifts drawn from this selection function determines fj, 
to a precision a^xjn 1 ! 2 and a 2 cl to a relative precision 
2 1/>2 /(n — l) 1 / 2 . Excluding interlopers with z < 1, we 
have measured roughly 800, 700, and 80 redshifts for 
galaxies in the LBG, BX, and BM samples, and the se- 
lection function width is <7 sc i ~ 0.3 for each. The relative 
uncertainty in <t sc i is therefore approximately ~ 3% for 
the LBG and BX samples and ~ 9% for the BM sample, 
which implies ~ 2% uncertainty in ro for the LBG and 
BX samples and ~ 5% uncertainty in the BM sample. 

3 The dependence of the integral constraint correction on ro 
means that errors in N(z) alter the inferred value of 7 as well. We 
will neglect this small effect. 



Variations of the selection function from one field to 
the next (owing, for example, to differences in the depth 
of the data or to systematic errors in our photometric 
zero points) are another source of concern, especially at 
the redshifts z ~ 2 where galaxies' colors are insensitive 
to redshift and small color errors mimic large redshift 
differences. Suppose for simplicity that all fields have 
the same number of photometric candidates, let the rms 
width of the selection function in the ith field be written 
ai = (1 + £j)s, where s is the mean width among all 
fields, and let the mean redshift of the selection function 
be written fj, + sSi where /z is the mean redshift among all 
fields. Then the rms width of the total selection function 

[Var(z)] 1/2 = s[l + Var(tf) + Var(e)] 1/2 , (24) 

exceeds the value s that should be used in determining 
A. Since we will (by necessity) use the total selection 
function in estimating ro, our estimates will be biased 
high. Systematic errors in our zero points are unlikely to 
be larger than Am = 0.05, and variations in photometric 
depth will at most change our characteristic color uncer- 
tainties from ~ 0.1 to ~ 0.2 magnitudes. The measured 
variations in galaxy redshift with U n GlZ color (see, e.g., 
Adelberger et al. 2004) imply (a) that zero point errors 
with Am = 0.05 will shift the mean redshifts of galaxies 
that satisfy the LBG, BX, and BM selection criteria by 
Az ~ 0.01, 0.13, and 0.11, respectively, and (b) that in- 
creasing the photometric uncertainty from a m = 0.1 to 
a m = 0.2 will increase the widths of the LBG, BX, and 
BM selection functions by ~ 10, 20, and 20%. The upper 
limits on Var((5) and Var(e) are therefore 0.002 (0.2) and 
0.01 (0.04), respectively, for the LBG (BX,BM) sample. 
The required reduction in ro is negligible for the LBG 
sample but could be as large as ~ 7% for the other two. 
We will account for uncertainties in the selection func- 
tion by decreasing the best-guess value of ro for the BM 
and BX samples by 3.5% and increasing the uncertainty 
in quadrature by 0.035ro. 

Contaminants 

As figure shows, some fraction of the objects in the 
BX and BM samples will be low redshift interlopers. 
We correct for the resulting dilution in the clustering 
strength by using the full selection function, starting at 
z = 0, in our estimate of ro from equation^] This is 
the optimal correction only if the interlopers have the 
same comoving correlation length ro ~ 4/i _1 Mpc (see 
below) as the galaxies in the primary samples. This 
should be nearly true, since Budavari et al. (2003) esti- 
mate ro = 4.51 ± 0.19/i -1 Mpc for the blue star-forming 
galaxy population at z ~ 0.2 from which our interlop- 
ers are drawn. In any case, since the correction itself is 
small — eliminating the tail with z < 1 from N(z) alters 
the inferred values of ro for the BM and BX samples by 
only ~ 10% — errors in it should not have an appreciable 
effect on our estimates of ro . 

3.2. Redshift clustering 

We face three significant obstacles in trying to estimate 
the clustering strength from the spectroscopic catalogs. 

(1) The objects in a given field that were selected for 
spectroscopy were not distributed randomly across the 
field, but were instead constrained to lie on one of a small 
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number of multislit masks. Since only a small fraction of 
the galaxies were observed spectroscopically in the typi- 
cal field, the finite size of the masks coupled with the need 
to avoid spectroscopic conflicts produced significant arti- 
ficial clustering in the angular positions of sources in the 
spectroscopic catalog. The effect was worsened in some 
fields by our decision to obtain particularly dense spec- 
troscopy near background QSOs. (2) Because galaxies' 
U n G1Z colors change slowly with redshift near z ~ 2, the 
expected redshift distribution N(z) of our BM and BX 
color-selected samples depends sensitively on the quality 
of the photometry. The larger color errors from noisy 
photometry will lead to a broader N(z), while relatively 
small systematic shifts in the photometric zero points 
can significantly alter the mean of N(z). Adelberger et 
al. (2004) and § I3.1.0l of this paper discuss this point in 
more detail, but the upshot is that we cannot estimate 
the selection function N(z) with great precision for the 
BX and BM samples. (3) Peculiar velocities and red- 
shift uncertainties render imprecise our estimate of each 
galaxy's position in the z direction. This limits the ac- 
curacy of our estimate of the distance from one galaxy 
to its neighbors, complicating our efforts to measure the 
correlation function on small spatial scales. 

Effects (l)-(3) are usually compensated with the aid 
of detailed simulations. Although this approach should 
work in principle, in practice it is hard for outsiders to 
evaluate whether the simulations were flawed. The re- 
mainder of this section describes the alternate approach 
that we adopt. It is based on analyzing observable quan- 
tities that are not affected by systematics (l)-(3). 

The spurious angular clustering signal can be elimi- 
nated if we take the angular positions of spectroscopic 
galaxies as given and estimate the clustering strength 
solely from their redshifts. Let Z be the comoving dis- 
tance to redshift z, and let R = (1 + z)Da(z)® be the 
transverse comoving separation implied by the angular 
separation between a galaxy and some reference po- 
sition, e.g., the center of the observed field. According 
to elementary probability identities, if we know that one 
galaxy has position (R2, Z2 ), then the probability that a 
second galaxy at transverse position Ri has radial posi- 
tion Z\ is 



P{Z 1 \R{R, 2 Z 2 ) 



P(ZiRi|Z 2 R2) 
| oc rf^P(^R 1 |Z 2 R 2 ) 



(25) 



and the expected distribution of radial separations Z\ 2 = 
Z\ — Z 2 for galaxies with transverse separation R\ 2 = 
|Ri-R 2 |is 

nOC 

P(Zu\R u ) = / dZ 2 P(Z 2 \R 12 )P(Z 12 \Z 2 R 12 ) (26) 
Jo 

~[1 + £(R 12 ,Z 12 )] x 

P(Z 2 )P(Z 2 + Z 12 ) 



dZ 9 . 



l + P(Z 2 )r2R\rP{l) 



(27) 



where we have used results from the previous section 
and adopted the shorthand /3(7) for the beta-function 
given above, f3{^) = 5(1/2, (7 - l)/2). (Equations E3 
through \H\ assume that the quantity (1 + z)Da(z) is 
constant with redshift, an approximation that is valid 
for the small separations \Z\ 2 \ ^ 40/t" 1 comoving Mpc 
between the galaxy pairs we will use in this analysis.) 



Equation 1271 shows that the observable quantity 
P(Zi 2 \R\ 2 ) is sensitive to the clustering strength but in- 
dependent of angular variations in the spectroscopic sam- 
pling density. Unfortunately the correlation function £ 
can be estimated from P(Zi 2 \Ri 2 ) with eauationl27lonlv 
if we have a reasonably accurate estimate of the selec- 
tion function shape P(Z). This can be seen more clearly 
by Taylor-expanding the integral in equation 1271 around 
Z\ 2 = and approximating the selection function as a 
Gaussian with standard deviation a sc i that is centered 
many standard deviations from z — 0. One finds 



P{Z 12 \R 12 )~ l + £(i?i 2 ,Z 12 ) 



where 



£ + ^r/ 12 + ^kf 12 (28) 



due~ u f n (u)/g{u,R 12 ), (29) 



fo = 1, h = u 2 - 1, h = u 4 - 6u 2 + 3, g = 2ir + 
(27r) 1 / 2 a(i? 12 )e-" 2 / 2 /cr scl and a = r^ 12 " 7 /3( 7 ). The co- 
efficients A n all have similar sizes since the integrals are 
dominated by contributions from \u\ ?S 1 where the inte- 
grands are of the same order. 

In the angular clustering case above, inaccuracies in 
the adopted width cr sc i of the selection function affected 
the inferred amplitude of the correlation function but not 
its shape. Here they affect both. Moreover errors in a se \ 
are multiplied not by £, but by 1 + £, which implies that 
they can easily dominate the true clustering signal when 
£ <C 1. Equation 1281 shows that one must be careful 
estimating the strength of redshift clustering when the 
shape of the selection function is poorly known. 

Our solution exploits the fact that ro <C ow for our 
survey, which implies Z\ 2 <C er SG i for all separations where 
£ is large enough to measure. As long as Z\ 2 -C cr se i, the 
terms proportional to Z\ 2 and Z\ 2 can be neglected in 
equation 1281 and P{Z\i\R\%) will be very nearly equal 
to C(i?i 2 )[l + £(i?i 2 , Z\ 2 f\, with C a function that does 
not depend on Z\ 2 . The function C does depend on the 
unknown selection function, but it can be eliminated by 
taking ratios of pair counts in a manner we discuss be- 
low. Ratios of P{Zi 2 \Ri 2 ) at fixed R\ 2 and different Z\ 2 
will therefore be the basis of our estimate of the cluster- 
ing strength in the spectroscopic sample; they are nearly 
immune to systematics from the irregular spectroscopic 
sampling and from the unknown selection function shape. 

The final complication is the significant uncertainty <jz 
in each object's radial position Z from peculiar veloci- 
ties and redshift uncertainties. This uncertainty can be 
treated in various ways. We will follow the standard ap- 
proach and estimate the value of the correlation function 
only within bins whose radial size AZ is large compared 
to a z . 

We are now ready to present the estimator that we 
adopt. Letting N(ai, a 2 , R) denote the observed number 
of galaxy pairs with transverse separation R and redshift 
separation a± < \Z\ 2 \ < a 2l the discussion of the preced- 
ing paragraphs shows that the expected total number of 
pairs with radial separation a\ < \Z\ 2 \ < a 2 (and any 
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transverse separation) is 

pairs 

(N tot ( ai ,a 2 )) = 2(o a -oi) C{Rij){l + Z aita2 (Ri. 

i>j 



where 



(R, 



1 



a-2 - ai J ai 
As long as iV to t is large enough that 

/ #tot(&l,&2) \ 



(30) 



dZ£(i?„,Z). (31) 



(JVtot (61,62)) 



\ iV tot (ai, a 2 ) + A r tot(6i, 62) 
the ratio of pair counts 



(iV to t(ai,a2) + iVtot(6i,62)) ! 

(32) 



K. 



61,62 — 



JVtot(6i,62) 



iVtot(oi, 
will have expectation value 



a.2) 



•iV to t(6i,6 2 ) 



61,62 \ , 



^Vexp(6l,6 2 ) 



N eKp (ai,a 2 ) + AT CX p(6i,6 2 ) 



N^ p (ai,a 2 ) 



N' 

exp 



(61,6a 



(33) 

(34) 
(35) 



regardless of angular selection effects, of uncertainties 
in the selection function, 4 of peculiar velocities, and 
of redshift measurement errors, provided a 2 <C cr se i, 
62 < CTsci, a 2 — ai > crz, 6 2 — 6i > (Tz, the selection 
function does not have strong features on scales smaller 
than <j se T, and (1 + z)Da(z) varies slowly with z. Here 
N cxp (ai,a 2 ) = (N tot (a 1 ,a 2 )) is given by equation EH and 



^exp(ai, a 2) = 2(a 2 



pairs 

E 



The second approximate equality in eauation l35l exploits 
the fact that C(Rij) is a very weak function of Rij in 
realistic situations. We estimate the correlation function 
from the spectroscopic sample by finding the parameters 
required to match the observed ratio K^'°?. In principle 
K^ 1 '^? could be calculated separately for pairs in different 
bins of transverse separation Rij , producing an estimate 
of the function -K^*'a 2 2 (-R) and allowing one to estimate 
both To and 7 from the data. In practice a much larger 
sample is needed to fit for both r$ and 7, so we hold 7 
fixed and estimate ro only. Fortunately, as we will see, 
the best fit value of r hardly changes as 7 is varied across 
the range allowed by the galaxies' angular clustering. 

The dependence of this estimator on the clustering 
strength is easy to understand intuitively. If the galax- 
ies were unclustered (£(r) = 0), we would observe the 
same number of pairs at every separation and -K^ „ 
would be equal, on average, to the ratio of the bin sizes 
?y = (6 2 — 6i)/(6 2 — 61 + a 2 — a±). Correlation functions 
that peak near r = will produce more pairs in bins at 

4 More correctly, (K) is independent of uncertainties in the se- 
lection function width. The expectation value (K) will be affected 
by errors in the mean redshift of the selection function if these er- 
rors are large enough to significantly alter the mapping of redshifts 
and angles to distances. 
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/ arcsec 



Fig. 3. — Angular correlation functions uncorrected for the 
integral constraint X. Points with error bars show the measured 
values of the estimator cj£,<j for each sample. The dashed line 
shows the power-law w = A6~P that fits the data best. The solid 
histogram indicates the number of galaxy pairs (/1000) in each 
angular bin. 



smaller separations, driving K^'^ 2 away from 77. The dif- 
ference between K^ 1 ^ 2 and 77 is sensitive to the strength 
of the clustering, and therefore can be used to estimate 
it. Adelberger (2005) uses Monte Carlo simulations to 
analyze the behavior of K^'^ in more detail. 

4. RESULTS 
4.1. Angular 

Figure shows the raw (integral-constraint correction 
1 = 0) values of the Landy-Szalay estimator u>ls (equa- 
tion 1101) as a function of angular separation for galax- 
ies in the three samples. We limited these data, and 
our subsequent fits, to angular separations 6 < 200", 
since at larger scales the weak angular-clustering sig- 
nal could be swamped by various low-level systemat- 
ics. The uncertainty cr, in each bin was taken to be 
the larger of (L>A) 1/2 / \RRi) (Peebles 1980, §48) and 
the observed standard deviation of the mean of ujls{6i) 
among the different fields in the survey. Typically the 
two were comparable. Numerical % 2 minimization pro- 
duced the power-law fits shown with dashed lines. The 
correlation function parameters implied by the LBG fit, 
r Q = 3.35 ± 0.20/1- 1 comoving Mpc, 7 = 1.74 ± 0.1, 
agree well with the estimates of Giavalisco & Dickinson 
(2001) which also assumed T = 0. It is clear, however, 
that these parameters cannot be correct. Substituting 
them into equations EH 1221 an d El shows that a signif- 
icant correction X should have been applied to account 
for fluctuations on scales larger than the field-of-view. 
(Porciani & Giavalisco 2002 reached a similar conclusion, 
and derived a result for LBGs that agrees well with the 
integral-constraint-corrected result we present below.) 

Figure 0] shows how our best-fit estimates of ro and 7 
change as the correction X is applied. In our first itera- 
tion, described above, we assumed 1 = and calculated 
the correlation function lo\(&). For the second iteration 
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Fig. 4. — Effect of the integral constraint correction X on the 
inferred correlation function. The correction depends on the clus- 
tering strength and is therefore unknown initially. The figure illus- 
trates the iterative approach described in the text. Small circles 
show the adopted value of X and the implied value of ro at each 
stage in the iteration. The numbers above points indicate the value 
of 7 for that iteration; for clarity they are given for only some it- 
erations. Adopting the initial guess 1 = leads to values of ro,7 
shown at X = on the plot. Recalculating X for these values of 
ro,7 leads to the revised estimate of the correlation function pa- 
rameters shown by the point just above and to the right. Follow 
the arrow. These new parameters require another update to X, 
which in turn requires a further adjustment to ro and 7. The pro- 
cess monotonically approaches the point of convergence marked by 
a large circle enclosing a cross. The same final value is reached 
if the initial guess at X is too large. In general larger corrections 
X imply larger correlation lengths ro and shallower slopes 7. The 
±lo" uncertainty in X (dark bar in each panel) is comparable to X 
itself, and is a major contributor of our total uncertainty. 



we assumed the value of X implied by uo\ (equations 1161 
123 122 and !21|) and estimated 0^2(6) ■ For the third itera- 
tion we calculated X from u>2 (9) ■ The process continued 
in this way until convergence. It settled on the same final 
parameters if we initially assumed a value for X that was 
too large. As figure 0] shows, the applied integral con- 
straint corrections were comparable for each of the three 
samples. This is because the increase in X implied by the 
longer correlation lengths at lower redshifts happened to 
be cancelled by a decrease in X that resulted from the 
lower-redshift samples' greater comoving depths. 

To check the plausibility of our adopted values for X, 
we inserted into equation 1181 the best power-law fits to 
Lo(0) from each sample's final iteration. The equation re- 
turned 0.008, 0.006, and 0.009 as the empirical estimates 
of X for BM, BX, and LBG samples. These values dif- 
fer somewhat from the ones we adopt (figure H}, because 
the empirical and CDM approaches (see l3.1.0*)l make dif- 
ferent assumptions about the behavior of u) (9) on the 
scales 9 > 200" where we cannot measure it, but they are 
consistent within their large la uncertainties and small 
changes (< la) to the best-fit values of 7 would make 
them agree perfectly. Readers may also be reassured to 
recall that our estimates of ro and 7 agree well with those 
of Porciani & Giavalisco (2002), who corrected for the in- 
tegral constraint in a completely different way. 



We estimated the random uncertainty in ro and 7 in 
two ways. First, we analyzed many alternate realiza- 
tions of our uj(9) measurements that were generated un- 
der the assumption that the uncertainties were uncorre- 
cted. To create a single alternate realization, we added 
to each measured value u(9i) a Gaussian random devi- 
ate with standard deviation equal to its uncertainty at. 
After creating numerous alternate realizations, we cal- 
culated and tabulated the values of ro and 7 implied by 
each. Our la confidence interval on ro was defined as the 
range that contained 68.3% of the measured values of ro 
among the alternate data sets. The 7 confidence interval 
is defined in the same way. We found ro = 4.0 ± 0.2ft -1 
Mpc, 7 = 1.57 ± 0.07 (LBG), r = 4.3 ± 0.2ft -1 Mpc, 
7 = 1.59 ± 0.04 (BX), and r = 4.7 ± 0.2ft -1 Mpc, 
7 = 1.55 ± 0.06 (BM). These numbers assume uncorre- 
lated error bars and neglect the uncertainty in our selec- 
tion functions. The uncertainty in X is also neglected, 
since each alternate realization had the same integral 
constraint correction. 

Our second approach was to extract random subcat- 
alogs from our full galaxy catalog, estimate ro and 7 
for each with the iterative solution for uj (9) described 
above, measure how the r.m.s. dispersion in best-fit pa- 
rameter values depended on the number of sources in the 
subcatalog, and extrapolate to the full catalog size. In 
fact we created our random subcatalogs in pairs, with 
both subcatalogs in a pair containing a random fraction 
/ < 0.5 of the sources in the full catalog and no sources 
in common between them, and estimated the uncertainty 
in ro at a given value of / as 2 -1 / 2 times the r.m.s. 
difference in ro among pair members. This prevented 
us from underestimating the random uncertainty in ro 
as / — > 0.5, when random subcatalogs could otherwise 
contain nearly the same galaxies. With this approach 
we estimate r = 4.0 ± 0.5ft -1 Mpc, 7 = 1.57 ± 0.12 
(LBG), r = 4.3 ± 0.3ft -1 Mpc, 7 = 1.59 ± 0.05 (BX), 
and r = 4.7± 0.5ft -1 Mpc, 7 = 1.55 ±0.07 (BM). These 
numbers neglect the uncertainty in our selection function 
and do not fully account for the uncertainty in X. They 
do not assume uncorrelated error bars, however, and we 
will therefore assume that they are more accurate than 
the numbers from the preceding paragraph. 

The uncertainty in X is not negligible. According to 
eauationll5lthe la uncertainty in X is ~ 35-50% as large 
as X itself for our 21 fields. As X varies over its la al- 
lowed range, the best-fit parameters ro and 7 change 
by roughly as much as the uncertainties quoted above. 
Adding these changes in quadrature to the random un- 
certainties above, and making the minor corrections for 
the selection function uncertainties discussed in § 13.11 
we arrive at the following estimates: ro = 4.0 ± 0.6ft -1 
Mpc, 7 = 1.57 ± 0.14 (LBG), r = 4.2 ± 0.5ft -1 Mpc, 
7 = 1.59 ± 0.08 (BX), and r = 4.5 ± 0.6ft -1 Mpc, 
7= 1.55 ±0.10 (BM). 

Other investigators (e.g., Giavalisco & Dickinson 2001; 
Foucaud et al. 2003) have claimed that at redshift 
z ~ 3 bright galaxies cluster more strongly than faint 
galaxies. Our data support this conclusion. Figure El 
shows that in the BX and LBG samples the correlation 
lengths of galaxies with 23.5 < K < 24.75 (r ~ 5.0ft -1 
Mpc) exceed those of galaxies with 24.75 < 1Z < 25.5 
(ro ~ 3.7ft -1 Mpc) by a significant amount. If we split 
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Fig. 5. — Correlation length ro for bright and faint subsamples 
of the BM, BX, and LBG samples. 



the BX and LBG samples into two halves at random, 
rather than by apparent magnitude, the difference in cor- 
relation lengths between the two halves is this large only 
about 5% (BX) to 24% (LBG) of the time. The situa- 
tion is less clear for the BM sample at z ~ 1.7, where the 
uncertainties are larger owing to the poor determination 
of N{z) from the small number of measured redshifts, 
but the data do not seem to suggest stronger clustering 
for brighter galaxies. On the one hand it makes sense 
that UV-brightness should become less associated with 
strong clustering as redshift decreases, since UV-bright 
galaxies are known to be weakly clustered at z = and 
z = 1. On the other, the overall clustering of the BM 
sample is still quite strong, stronger than one expects for 
typical collapsed objects at z ~ 1.7 (see below), and so it 
seems that the numerous objects too faint to satisfy our 
selection criteria must be less clustered than the bright 
objects in our sample. We will wait for additional spec- 
troscopic observations of BM galaxies before commenting 
further. 

The dependence of clustering strength on luminosity 
can produce a false impression of a change in ro with 
redshift, since lower redshift samples will tend to reach 
fainter absolute luminosities. Truncating the samples at 
a fixed absolute luminosity does not seem a good solu- 



tion to us, however, since the bright end of the UV lu- 
minosity function rises rapidly towards higher redshifts 
(e.g., Adelberger & Steidel 2000) and one would there- 
fore be comparing rare objects at lower redshifts to com- 
mon objects at higher redshifts. A better approach is to 
compare galaxy samples of roughly the same comoving 
number density. Since selection with a constant appar- 
ent magnitude limit 1Z < 25.5 happens to produce simi- 
lar comoving number densities for the three samples (see 
equations 1391 and 1401 and the related discussion), we will 
continue to use the constant apparent magnitude limits 
of equation H for our samples in the remainder of the pa- 
per. Readers should be aware that the reported value of 
each sample's correlation length is somewhat arbitrary 
for this reason. It reflects the characteristics of the sam- 
ple as defined here, not of the general galaxy population 
at high redshift. 

4.2. Redshift 

For our estimator of the redshift clustering strength 
we took K^q^q, the ratio of the number of galaxy pairs 

with comoving radial separation < |Z|/(ft -1 Mpc) < 
20 to those with comoving radial separation < 
|Z|/(ft -1 Mpc) < 40. Since 20ft -1 Mpc is significantly 
larger than the uncertainty in each galaxy's radial po- 
sition (az — 300kms~ 1 (l + z)/H(z) ~ 3ft -1 comoving 
Mpc), and since 40ft. -1 Mpc is significantly smaller than 
the selection functions' widths (ct sc i > 200ft -1 comoving 
Mpc), the expected value of K^q^o should be given by 
equation[!21 We limited our analysis to pairs with trans- 
verse separations 6*^ < 300", equivalent to R^ ^ 5.9ft. -1 
comoving Mpc at z = 2.5, to reduce the sensitivity of 
our results to any deviations of the correlation function 
from a 7 = 1.55 power-law on large scales. 

Only the BX and LBG spectroscopic samples were 
large enough to allow meaningful measurements of 
K®q4q. For 7 = 1.55, the right-hand side of equation I3H 
is equal to the observed ratio K^q^q when ro = 4.6ft. -1 

(LBG) or 4.5ft -1 (BX) comoving Mpc. The values 
change by roughly ±2% as 7 is varied from 1.45 to 1.65. 
When analyzed with this technique, mock galaxy cata- 
logs from the GIF simulation fS I2.2(l with sizes similar to 
our observed catalogs show a ±ler dispersion in ro around 
the true mean of 0.6ft -1 (LBG) and 0.9ft -1 (BX) comov- 
ing Mpc, so we adopt 4.6 ± 0.6ft -1 and 4.5 ± 0.9ft -1 Mpc 
as the best fit values to ro for our spectroscopic catalogs. 
The results do not change significantly if we eliminate 
pairs with 9ij < 60" from the analysis, showing that 
we have measured genuine large-scale clustering and not 
merely the clumping of objects within individual halos. 

Figure presents the result in a more graphical way. 
We divided our lists of galaxy pairs into bins according 
to transverse separation Rij , then calculated K^^q sep- 
arately for each bin. Points with error bars show the val- 
ues we found. The solid lines show the values predicted 
by the 7 = 1.55 correlation function described in the 
preceding paragraph. The plot shows that the derived 
correlation function parameters provide a reasonable fit 
to the clustering of the galaxies in the spectroscopic sam- 
ple. 

4.3. Summary 
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Fig. 6. — Redshift clustering in the BX and LBG samples. The 
ordinate shows K^^g , the fraction of galaxy pairs with comoving 
radial separation < Z < i0h~ l Mpc that have < Z < 20h~ 1 
Mpc. K^OiO was calculated for galaxy pairs in different bins of 
transverse separation R. In the absence of clustering K^q^o wou ld 
be equal to 0.5, on average, at all values of R. Instead our data 
(points with error bars) show that K^lo * s g enerau y larger than 
0.5, especially at smaller R. This implies a positive correlation 
function that rises towards smaller separations. The fractions at 
the top of the plots show the number of galaxies with < Z < 
20k' 1 Mpc and < Z < 40h~ 1 Mpc in each bin of R. The 
error bars are equal to n 1/l2 /d where n is the numerator and d 
the denominator in the fraction. They are only approximate; the 
actual analysis treated uncertainties as described in the text. The 
solid line shows the values of K^q^o implied by the best fit of a 

power-law correlation function £(r) = (r/ro)~ 1,55 to the data when 
they are placed in a single radial bin. 



We presented two independent estimates of the corre- 
lation function for each of our galaxy samples. The es- 
timates were consistent with each other, but the first, 
based on the galaxies' angular clustering, had some- 
what smaller uncertainties. This resulted from the larger 
size of the photometric sample, and was accentuated by 
the serious systematics in the spectroscopic sample that 
made us throw much of our data away. We will adopt the 
angular clustering results for the remainder of the paper. 

5. IMPLICATIONS 

5.1. Correspondence to halos 

On small scales, smaller than roughly the typical radius 
r v ; r of a virialized halo, the spatial clustering of galax- 
ies is difficult to predict or interpret. It depends on the 
ease with which nearby galaxies merge with each other, 
on the ability of a galaxy to maintain its star-formation 
rate as it orbits within a larger potential well, on the pos- 
sible impact of a galaxy's feed-back on its surroundings, 
and so on. On larger scales these baryonic complications 
have little effect and the correlation function of galaxies 
should be virtually identical to the correlation function 
of the halos that host them. To see that this is true, 
consider the galaxy correlation function in a simplified 
situation where every galaxy is associated with a single 
halo and the probability that a galaxy lies a distance r 



from its halo's center is f(r). In this case the galaxy dis- 
tribution will be a Poisson realization of the continuous 
density field p g (r) that is created when the discrete halo 
distribution is convolved by /, and the galaxy correlation 
function £ g will be equal to the correlation function of p g 
(see, e.g., Peebles 1980, eq. 33.6). Since the halo pow- 
erspectrum is P/j(k) = l/n + / d 3 r£i l exp(ik • r) (Peebles 
1980, eq. 41.5), where £/, is the halo correlation function 
and n is the halo number density, since the powerspec- 
trum of p g is equal to |/(k)| 2 P/j(k), where / is the Fourier 
transform of /, and since the powerspectrum and corre- 
lation function are Fourier-transform pairs, the galaxy 
and halo correlation functions will be related through 

e a (r)=P(r)/n + F(r)®^(r) (37) 

where ® denotes convolution and F = f ® f . Now 
/(r) = when r is greater than some maximum sepa- 
ration r max ~ r V j r ; galaxies cannot be located arbitrarily 
far from the center of their halo. The first term will 
therefore be zero for r 3> 2r V j r . The second term will be 
almost exactly equal to £h( r ) at the same large separa- 
tions, because plausible correlation functions do not not 
change significantly from r to r + 2r v ; r when r v ; r <C r. 
This shows that £, g (r) ~ £/i( r ) f° r large r. Although it 
was derived for a simplistic model, the result is more gen- 
eral. As long as galaxies are associated in some way with 
dark matter halos, and as long as there is some maximum 
separation r max ~ r vlI between each galaxy and the cen- 
ter of its halo, galaxies will have the same correlation 
function as the halos that host them for r 3> 7Vi r . We 
will accordingly focus our attention solely on separations 
r ^ lh^ 1 comoving Mpc that are many times larger than 
the typical virial radius. 

Readers who are skeptical of the claimed similarity be- 
tween galaxy and halo correlation functions at r r V j r 
may wish to consider figure which compares observed 
galaxy correlation functions at redshifts z — 0.2, 1.0, 3.0 
to the halo correlation functions in the GIF-LCDM sim- 
ulation outputs at the same redshifts (see § I2.2fl . The 
agreement is excellent. 

The implied association of galaxies with massive po- 
tential wells is hardly surprising. The interesting result 
is the characteristic mass of the virialized halos that con- 
tain the galaxies. This can be estimated since more mas- 
sive halos cluster more strongly. Figure |S] compares the 
correlation functions at 1 ^ rjihr 1 comoving Mpc) ^ 10 
for galaxies in the BM, BX, and LBG samples to the cor- 
relation functions of virialized dark matter halos above 
various mass thresholds in the GIF-LCDM simulation 
(see S 12.2(1 . The agreement is best if galaxies in the BM, 
BX, and LBG samples are associated with halos of mass 
M ~ 10 121 M Q , 1O 12 O M , and 10 115 M Q , respectively. 

Uncertainties in our measured correlation functions 
lead to uncertainties in the estimated characteristic 
masses. The size of these uncertainties can be gauged 
most cleanly by comparing our measured angular correla- 
tion function to the angular correlation functions of halos 
above different mass thresholds. We calculated halo an- 
gular correlation functions numerically from equation 
after substituting in the observed redshift distributions 
for the different samples (fig. P) and the GIF-LCDM halo 
correlation functions at the mean redshift of each sample. 
Typical results are shown in figure |5J The halo angular 
correlation functions fall significantly below the data on 
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Fig. 7. — Comparison of observed galaxy correlation functions 
to the expected correlation functions of dark matter halos. Solid 
lines show the power-law fits implied by the parameters reported 
by Budavari et al. (2003; r = 5.77±0.09, 7 = 1.84±0.02, 2 ~ 0.2), 
by Coil et al. (2003; r = 3.19 ± 0.51, 7 = 1.68 ± 0.07, z ~ 1.0), 
and by us (§0 z ~ 3.0). In each sample the galaxies' clustering 
was measured out to a maximum transverse separation of roughly 
10/i — 1 comoving Mpc. Circles show t he co rrelation function of 
halos in the LCDM-GIF simulation (see § 12,21 , The halo correlation 
functions were calculated at the redshift indicated in each panel, 
and only halos with masses larger than the indicated threshold 
were included. The comparatively small number of halos with M > 
10 13 Mq is responsible for the noisier halo correlation function at 
z = 0.2. In general the agreement between the galaxy and halo 
correlation functions is excellent, although differences between the 
galaxy and halo correlation functions arise as r — > 2h~ Mpc and 
z — > because r is no longer enormous compared to the halo radius. 



small scales, since (by definition) a halo cannot have an- 
other halo as a neighbor within the radius r v ; r . As argued 
above, however, it is the larger scales r ^ l/i -1 comov- 
ing Mpc (i.e., 9 1') that are relevant for comparing to 
galaxies, and here the agreement is good. The uncer- 
tainty in the galaxies' implied mass scale is dominated 
by the uncertainty in the integral constraint correction X 
that was applied to the data, since this moves all points 
up or down together. Rough la limits on the threshold 
masses of the hosting halos are 
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< M/Mq 
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(38) 



Readers unfamiliar with the idea of threshold masses may 
wish to see footnote 6 in §G\ below, for further discussion 
of how to interpret them. 



Fig. 8. — Our best correlation function fits (solid lines) compared 
to the correlation functions of virialized dark matter halos in the 
GIF simulation. The halo correlation functions were calculated 
for halos with mass M > 10 114 Af Q , M > lO 12 o Af , and M > 
10 12 6 M Q that were identified at redshifts z = 1.77 (BM), 2.32 
(BX), and 2.97 (LBG). The dependence of halo clustering strength 
on mass allows one to estimate characteristic masses of the halos 
that are associated with the galaxies in our samples. 



One implication of figure is that some halos are oc- 
cupied by more than one of the galaxies in our samples; 
the data on small scales are strongly inconsistent with the 
correlation functions that assumed one object per halo. 
(This was first pointed out by Wechsler et al. (2001) 
and later denied by Porciani & Giavalisco (2002). Our 
analysis agrees far better with that of Wechsler et al.) 
The fraction of LBGs that reside in the same halo as an- 
other LBG can be calculated as follows. If there were 
never more than one LBG per halo, the expected num- 
ber of LBGs within 1' of a randomly chosen LBG would 
be Ni = Af(l + u>i) where A/" is the surface density of 

LBGs, u>i = J 9d6ujh{9)/ J Q 9d6, and uih is halo angu- 
lar correlation function that fits our data best for 9 > V . 
The actual number of LBGs within 1', At rU c, is given by 
the same equation with the halo correlation function tOh 
replaced by the galaxy correlation function Wt rU c- The 
expected number of additional LBGs in a halo that is 
known to contain one LBG, /lbGj is equal to the dif- 
ference between A truc and N±. Numerically integrating 
the angular correlation functions for observed galaxies 
and simulated halos, and multiplying by the LBG sur- 
face density from tabled we estimate /lbg ~ 0.05. The 
numbers for the other samples, estimated with a similar 
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we estimate 



n BX = (6 ± 3) x lCT 3 /i 3 Mpc~ 3 
n BM = (5 ± 2.5) x l(T 3 /i 3 Mpc~ 3 



(40) 



200 



Fig. 9. — Observed galaxy angular correlation functions com- 
pared to the angular correlation functions of halos in the GIF- 
LCDM simulation. Points with error bars show our observations, 
including integral constraint correction X ~ 0.01. (Figure HI shows 
the precise values of X that were adopted.) Solid lines show the 
halo angular correlation functions for GIF-LCDM halos of mass 
M = 10 119 , lO 12 - 3 A/ at the identification redshift z = 1.77 
(BM), of mass M = 10 11 ' 8 , 10 12 2 M Q at the identification redshift 
z = 2.32 (BX), and of mass Al = 10 112 , 10 118 M Q at the identifi- 
cation redshift z = 2.97 (LBG). The angular correlation functions 
were calculated for the redshift distributions shown in figure [3 In 
each case the curves for the more massive halo are on top. The 
plateau in the halo angular clustering at small radii results from 
the definition of a halo, which imposes a minimum separation for 
halo neighbors by counting two halos as the same object if they lie 
too close together. Dotted lines show the angular correlation func- 
tions that result when the power-law fits to the three-dimensional 
halo correlation functions at 2 < r < I0h~ 1 comoving Mpc are 
extrapolated to r = 0. They are for reference only; the galaxy 
and halo angular correlation functions should be compared only 
at the separatio ns 6 Si 60" where they are expected to be simi- 
lar (see equation 1371 . The two halo curves in each panel roughly 
span the range in angular clustering strength allowed by the uncer- 
tainty of ~ 0.005 in X, and so the masses of the halos listed above 
should roughly bracket the allowed masses of the halos that host 
the galaxies. 



approach, are /bx ~ 0.25 and /bm ~ 0.25. Some galax- 
ies must share the same halo to explain the data, but 
the required number is small. Note that we are referring 
solely to galaxies that satisfy our color and magnitude se- 
lection criteria; we obviously cannot say anything about 
the spatial distribution of galaxies that are not in our 
samples. 

Having established a rough characteristic mass for the 
halos that contain our galaxies, we can compare the 
galaxy and halo number densities and estimate what frac- 
tion of the most massive halos at redshifts 1.5 < z ^ 3.5 
do not contain a galaxy that is detectable with our tech- 
niques. According to Adelberger & Steidel (2000) the co- 
moving number density of LBGs brighter than 1Z = 25.5 
is 

n LBG = (4 ± 2) x 10~ 3 /i 3 Mpc- 3 (39) 

for n M = 0.3, il A = 0.7. Combining the BM and BX 
completeness coefficients in table 3 of Adelberger et al. 
(2004) with the surface densities in table^of this paper, 



where the assigned uncertainties of 50% are approximate 
guesses that will be refined later with Monte Carlo sim- 
ulations. Number densities for the population of halos 
that contain the galaxies can be estimated from the GIF- 
LCDM simulation given the range of halo masses (equa- 
tion |2Hl that are compatible with the galaxies' cluster- 
ing strength. Figure lTni shows that the number density of 
galaxies in the BM, BX, and LBG samples is comparable 
to the number density of halos that can host them. As we 
will discuss in §0 below, this implies that the duty cycle 
of star-formation in the galaxies must be of order unity 
and shows that our surveys cannot be severely incom- 
plete. Similar arguments have been made by Adelberger 
et al. (1998), Giavalisco & Dickinson (2000), Martini & 
Weinberg (2001), and others. 

5.2. Evolution 

The spatial distribution of a population of galaxies 
evolves in an easily predictable way as it responds to 
the gravitational pull of dark matter. We used a simple 
approach to estimate this evolution from the LCDM-GIF 
simulation. After connecting the observed galaxies to ha- 
los with a range of masses (equation I38|) , we measured 
the evolution in the clustering of those halos in the sim- 
ulation and assumed that the galaxies' clustering would 
evolve in the same way. See § 12.21 Figure pj] shows the 
implied change in correlation length tq with time for the 
galaxies in our samples. By z ~ 2 galaxies in the LBG 
sample will have a correlation length similar to measured 
correlation lengths of galaxies in the BM and BX sam- 
ples. By z ~ 1 their correlation length will be similar 
to the correlation length of early- type (i.e., "absorption 
line") galaxies in the sample of Coil et al. (2003). By 
z ~ 0.2 their correlation length will be equal to the ob- 
served correlation length of ellipticals (Budavari et al. 
2003). The evolution of ro for galaxies in the BM and 
BX samples is similar. 

Figures IT21 and H~3l present a more detailed view of the 
possible relationships between the descendants of galax- 
ies in our samples and various galaxy populations at 
lower redshift. Figure IT2l shows that at z = 1 the de- 
scendants' clustering strength will significantly exceed 
that of average galaxies in optical magnitude-limited sur- 
veys. Since these surveys are dominated by star-forming 
("emission-line") galaxies, we can conclude that the typ- 
ical descendant is no longer forming stars by z ~ 1. A 
similar point was made by Adelberger (2000) and Coil et 
al. (2004) . The stronger clustering of redder and brighter 
sub-populations at z ~ 1 is more compatible with the de- 
scendants' expected clustering, but the match is best for 
the sub-population with early-type spectra. This is espe- 
cially true for descendants of the brightest galaxies in the 
BX and LBG samples. Although the observed number 

5 In principle the descendants could have emission-line spectra 
if they made up a small part of an otherwise weakly clustered 
population, but this possibility seems to be ruled out by the fact 
that the number densities of BM/BX/LBG galaxies are similar to 
the number density of galaxies in the z ~ 1 emission-line sample. 
(Our estimate of th e number density in the emission-line sample, 
shown in figure [TU1 is from A. Coil, private communication.) 
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Fig. 10. — The comoving number densities of galaxies (G) 
in various samples compared to the comoving number density of 
the halos (H) at the same redshift that have the same clustering 
strength (on large scales) as the galaxies. These are the halos that 
the galaxies can reside within. Shaded regions are for star-forming 
galaxies and their associated halos: the "blue" SDSS sample of 
Budavari et al. (2003; z = 0.2), the emission-line sample of Coil 
et al. (2004; z = 1), and our three samples (z > 1.5). The height 
of the rectangles covers the ±lcr range in number density. The 
uncertainties in halo number density follow from the uncertainty 
in their galaxy's clustering strength. At each redshift star-forming 
galaxies are roughly as numerous as the halos that can host them. 
Galaxies that are no longer forming stars tend to outnumber halos 
with similar clustering strength, which shows that several reside in 
the same halo. This is evident in both the absorption-line sample 
of Coil et al. (2004; dotted rectangles at z = 1) and the elliptical 
sample of Budavari et al. (2003; -23 < M r < -21, dotted rect- 
angles at z = 0.2). For consistency with the Budavari sample, the 
number density of early-type galaxies at z ~ 1 was calculated by 
integrating the z ~ 1 early-type luminosity function of Chen et al. 
(2003) over all magnitudes brighter than Mr = —21. 



density of early- type galaxies at redshift z ~ 1, roughly 
(7±1.5) x 10~ 3 /i 3 Mpc~ 3 (Chen et al. 2003), is consistent 
with the idea that most had a BM/BX/LBG galaxy as 
a progenitor, we cannot rule out the idea that some had 
multiple merged BM/BX/LBG progenitors and others 
had none. 

Figure ED is an analogous plot for redshift z ~ 0.2. 
Correlation lengths for various populations in the Sloan 
Digital Sky Survey (SDSS) were taken from Budavari 
et al. (2003). Number densities were calculated assum- 
ing a surveyed volume of 10 8 /i -3 Mpc 3 with an uncer- 
tainty of ~ 15% (T. Budavari 2004, private communica- 



FlG. 11. — Clustering strength versus redshift. Top panel: the 
large, lightly shaded point shows the correlation length we infer 
for galaxies in the BM sample. The arrow shows the correlation 
length these galaxies will have at lower redshifts. The top edge 
of the arrow shows the evolution of ro for halos in the LCDM- 
GIF simulation that had mass exceeding Mt op = 1O 12 ' 3 M0 at the 
redshift of identification 2;^ = 1.77; the bottom edge is for halos 
with mass exceeding Afbot = 10 11,9 Mq. Smaller, darker points 
show measurements of ro from other samples: the SDSS "elliptical" 
and blue samples of Budavari et al. (2003; z = 0.2 and ro = 6.6, 4.5 
respectively), the DEEP full and absorption-line samples of Coil et 
al. (2003; z = 1 and ro = 3.2, 6.6 respectively), and the BX and 
LBG samples of this paper. Middle panel: similar to the top panel, 
except z id = 2.32, M top = 1O 12 2 M , M bot = lO 11 ' 8 ^^, and the 
large point is for the BX sample. Bottom panel: similar to the top 



panel, except 2;<] = 2.97, Mto 



10 1 



j M , M bot = 1O 112 M , 



and the large point is for the LBG sample. 



tion). The expected z — 0.2 clustering strength of typ- 
ical LBG descendants (darker shaded box) agrees best 
with the clustering of galaxies with early-type SEDs in 
the Budavari et al. (2003) sample. The galaxies in our 
high-redshift samples are roughly as numerous as these 
early- type galaxies, though the possibility of merging 
prevents us from estimating the number density of their 
BM/BX/LBG descendants at z ~ 0.2. The descendants 
of brighter LBGs will have a correlation length closer 
to that of bright ellipticals, though there are probably 
not enough descendants to account for the entire bright 
elliptical population. 

6. SUMMARY & DISCUSSION 

The first part of the paper was concerned with measur- 
ing the spatial clustering of large samples of star-forming 
galaxies at redshifts z ~ 1.7, 2.2, and 2.9. We fit a three- 
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FlG. 12. — Clustering strength of LBG descendants and of dif- 
ferent galaxy populations in the z ~ 1 DEEP optically selected 
sample of Coil et al. (2003). Galaxies at z ~ 1 that are brighter 
(Mg < —19.75), that are redder in observed Ft — I or rest-frame 
(B — R)o, or that have early-type (absorption line) spectra cluster 
more strongly than star-forming (emission line) galaxies or than 
galaxies in the full sample ("All"). The shaded regions in the 
background show the range of correlation length at z = 1 ex- 
pected for descendants of typical (23.5 < TZ < 25.5) and bright 
(23.5 < TZ < 24.75) LBGs. The correlation len gths are similar for 
descendants of BX and BM galaxies (see figure 1111 . The galaxies 
at z ~ 1 that cluster strongly enough to be descended from BM, 
BX, or LBG galaxies have early type spectra and are red. 



5 - 



brightest bright E 



J normal E 

— $ bright 



bright blue 



Alii - 



normal blue 



0.001 



0.005 0.01 

n / (h- 1 Mpc) 3 



Fig. 13. — Correlation length and number density of LBG descen- 
dants and of different galaxy populations in the SDSS sample of Bu- 
davari et al. (2003). The darker shaded region in the background 
shows the range of clustering strength at z = 0.2 expected for the 
descendants of typical LBGs (23.5 < TZ < 25.5) and the observed 
number density of typical LBGs at z ~ 3. The lighter shaded re- 
gion above and to the left is for bright LBGs (23.5 < TZ < 24.75). 
Values for BX an d BM descendants are similar (see figure ITTl and 
equations 1391 and 141 It . Points show measured values for galaxy 
populations at z ~ 0.2. The descriptive labels on the points cor- 
respond to the sub-samples defined by Budavari et al. (2003) as 
follows: All — > full sample; bright — » -21 > M r > -22; brightest 
-+ -22 > M r > -23; normal blue -> M r > -21, t > 0.65; bright 
blue -> -21 > M r > -23, t > 0.65; normal E -» M r > -21, 
t < 0.02; bright E -> -21 > M r > -23, t < 0.02. 



dimensional correlation of the form £(r) = (r/ro) 1 to 
the galaxies' angular clustering with standard techniques 
and to the galaxies' redshift clustering with a new estima- 
tor. The new estimator, K, is insensitive to many of the 
possible systematic biases in our spectroscopic surveys. 
We reached consistent conclusions about the correlation 



function with the two approaches, but adopted the an- 
gular results since their random uncertainties were some- 
what smaller. As given in § 14.11 the best-fit correlation- 
function parameters from the angular clustering are 



ro/fa^Mpc)^ 



4.0 ±0.6, 1.57 ±0.14 LBG 
4.2 ±0.5, 1.59 ±0.08 BX 
4.5 ±0.6, 1.55 ±0.10 BM 



(41) 

where BM, BX, and LBG are the names we have given 
the U n GlZ color-selection criteria that we used to find 
galaxies at z ~ 1.7, 2.2, and 2.9 (§ED- The quoted la 
errors include random uncertainties, uncertainties in the 
integral constraint corrections, and uncertainties in the 
shapes of the selection functions. 

Since the value of tq depends on the apparent- 
magnitude limit of the samples, at least in the two higher- 
redshift bins (figure EJ), the reported values of r are 
somewhat arbitrary. We chose to limit each sample to 
a fixed range of apparent magnitudes, 23.5 < TZ < 25.5, 
on the grounds that this resulted in a similar comoving 
density in each redshift bin. Different magnitude lim- 
its would have resulted in different correlation lengths. 
Readers should be aware that the numbers we give are 
appropriate to the samples as we have defined them, not 
to the general galaxy population at high redshifts. 

The second part of the paper was based on the proposi- 
tion that WMAP (Spergel et al. 2003) and other experi- 
ments have given us reliable measurements of the cosmo- 
logical parameters and of the shape of the dark matter 
power-spectrum. This implies that we know what sorts 
of virialized dark-matter halos existed at different epochs 
in the past and how their spatial distribution evolved 
over time. Since galaxies reside within dark-matter ha- 
los, they will have the same correlation function as the 
halos on large scales feauation l57|) . The galaxies' cluster- 
ing should therefore tell us what sort of halo they reside 
within. We found a good match (figures |H1 and EJ) be- 
tween the correlation functions of the galaxies and of ha- 
los with threshold masses ranging from 10 115 M Q (LBG) 
to 10 121 M o (BM). 6 EquationOHlgives rough la limits on 
the halos' total masses. Similar masses for Lyman-break 
galaxies have been derived with the same approach by 
Jing & Suto (1998), Adelberger et al. (1998), Giavalisco 
& Dickinson (2000), Porciani & Giavalisco (2002), and 
others. Although the estimated masses were derived 
solely from the galaxy clustering, they seem reasonable 
on other grounds. They cannot be much higher. The 
number density of halos would be lower than the num- 
ber density of LBGs, for example, if the halo mass were 
greater than 10 118 M Q . Such large halo masses would be 
possible only if significantly more than one LBG resided 
in the typical halo, and that is something that our ob- 
servations rule out (§ 15.1(1 . Nor can they be much lower. 
The halos that contain LBGs would not have enough 

6 When we say (for example) that the galaxies have the same 
correlation function as halos with threshold mass M = 10 11 ' 5 M@, 
we mean the subset of halos with mass M > 10 11,5 Mq. The 
median mass of this subset is M = 1O 11 ' 86 M in the GIF-LCDM 
simulation at z = 2.97. Halo subsets can be defined with schemes 
more elaborate than our simple mass threshold (e.g., Kauffmann et 
al. 1999, Bullock, Wechsler, & Somerville 2002), but the differences 
between the possible schemes are too small to affect our analysis 
when the subsets they produce are constrained to have the same 
clustering on large scales. 
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baryons to form the median LBG stellar mass of roughly 
10 10 M Q (Shapley et al. 2001 7 ) unless their total mass 
were greater than about 10 11 M Q . 

We should mention in passing that our best-fit halo 
masses seem to imply that only a small fraction of the 
baryons in the halos are associated with the observed 
galaxies. For example, the best-fit mass threshold of 
10 for LBGs corresponds to a median total mass 

of 10 u - 8 °Mq and median baryonic mass of 1.2 x 10 11 M Q 
(for fl b /n M 0.17, Spergel et al. 2003), roughly ten 
times larger than the observed stellar masses of LBGs. 
Since the 10 s supernovae that explode during the assem- 
bly of the typical LBG's stellar mass will eject roughly 
10 8 M Q of metals (e.g., Woosley & Weaver 1995), enough 
to enrich at most 1.3 x 10 10 M Q of gas to LBGs' typical 
metallicities of 0.4Z Q (Pettini et al. 2002), their observed 
interstellar gas cannot contain a large fraction of the re- 
maining baryons. These baryons need not be associated 
with other objects in the halo, however. They may be 
locked in dim stars that formed in previous episodes of 
star- formation (e.g., Papovich, Dickinson, & Ferguson 
2001), or may have been heated by various processes to 
undetectably high temperatures. The latter is presum- 
ably the case for nearby galaxies, whose ratios of mass 
in stars and gas to total mass are usually also smaller 
than the WMAP value n b /Q M ^ 0.17. The Milky Way, 
for example, has a total mass of 10 12 M© (Zaritsky 1999; 
Wilkinson & Evans 1999) and a mass in gas and stars of 
only - 8 x 10 10 A/ Q (K. Freeman 2004, private commu- 
nication), yet few would assert that its missing baryons 
belong to another galaxy in its halo. 

After establishing plausible total masses for the ha- 
los associated with the galaxies, we considered some of 
the implications. Our arguments were not new (see, e.g., 
Moustakas & Somerville 2002, Martini & Weinberg 2001, 
Adelberger et al. 1998). They seemed worth revisiting 
only because our knowledge of the cosmogony, of the lo- 
cal universe, and of high-redshift galaxies has improved 
so much in the last few years. We began by estimat- 
ing the completeness of our surveys from a comparison 
of the galaxies' number densities to the number densi- 
ties of halos with similar clustering strength (figure 1101) . 
Similar number densities would imply that almost all of 
the most massive halos contained a galaxy that satisfied 
our selection criteria; a much lower galaxy number den- 
sity would imply that most of the galaxies in massive 
halos are missed by our survey. Defining rj as the ra- 
tio of galaxy to halo number density, we found rough 
la limits of 0.2 < ?7lbg < 1, 0.6 < ?7bx < 3, and 
0.5 < ?7bm < 2.5. These limits were derived from the 
clustering at radii r ^ lh^ 1 comoving Mpc. The cluster- 
ing on smaller scales, sensitive to the possible presence 
of more than one galaxy in a halo, implies that the up- 
per limits on ?7bx and tjbm should be revised downwards 
to ~ 1.25. The data appear consistent with the claim of 
Franx et al. (2003) that our selection criteria find roughly 
half of the most massive galaxies at z ~ 2. A complete- 

7 This mass is for a Baldry-Glazebrook (2003; eq. 3) IMF, and is 
therefore 1.82 times lower than the value Shapley et al. calculated 
for an IMF with a Salpeter slope between 0.1 and IOOMq. Their 
assumed Salpeter IMF is probably unrealistic since the IMF in the 
solar neighborhood is known to flatten near ~ IMq and eventually 
turn over at lower masses. See Leitherer (1998) or Renzini (2004) 
for further discussion. 



ness of order 50% seems plausible to us for other reasons 
as well. Shapley et al. (2001) estimate a lifetime for the 
typical LBG of ~ 3 x 10 s yr, for example, which implies 
that the typical LBG will be bright enough for us to de- 
tect for only about half of the time that elapsed between 
the survey selection limits of z ~ 3.4 and z ~ 2.6. 

We considered next the way the clustering of the galax- 
ies would evolve (figure flj). Analysis of the GIF-LCDM 
simulation suggested that the correlation length of LBG 
descendants would be similar by z ~ 2.2 to the correla- 
tion length of galaxies in the BX sample and by z ~ 1.7 
to the correlation length of galaxies in the BM sample. 
The spatial clustering is therefore consistent with the 
idea that we are seeing the same population at all three 
redshifts, though the selection criteria's ~ 50% incom- 
pleteness leaves room for the populations to be distinct 
and the difference in stellar masses between the LBG 
(1O 1O M ) and BX (2 x 10 10 M Q ; Steidel et al. 2005, in 
preparation) samples may not be consistent with con- 
tinuous star-formation at observed rates through the 
elapsed time. Turning our attention to lower redshifts, 
we found that at z ~ 1 our descendants' clustering would 
most closely match the observed clustering of galaxies 
that are red and bright and have early-type spectra (fig- 
ure By z ~ 0.2 the estimated clustering of the de- 
scendants suggested elliptical galaxies as the most likely 
counterparts (figure IT3"| . The correspondence is espe- 
cially hard to dispute for the descendants of the brightest 
and most strongly clustered galaxies in the high-redshift 
samples. 

One conclusion seems difficult to escape: the descen- 
dants of the galaxies in our samples must have signifi- 
cantly larger stellar masses than their high-redshift fore- 
bears. Only ~ 25% of the total stellar mass in the local 
universe is found in galaxies with stellar masses smaller 
than 2 x 1O 1O M (Kauffmann et al. 2003), similar to 
the values in our high-redshift samples, and these faint 
galaxies are too weakly clustered to have descended from 
the galaxies we find at 1.4 < z < 3.5. Only elliptical 
galaxies have a spatial distribution consistent with our 
expectations for the descendants, and the characteristic 
stellar mass of ellipticals is 10 u Mq (Padmanabhan et al. 
2004). The increase in stellar mass from z ~ 2 to z ~ 
could have been produced by ongoing star formation or 
by mergers. Our results at redshift z ~ 1 may favor the 
latter, but in any case our findings are strongly inconsis- 
tent with traditional notions of monolithic collapse. 

It would be unfair to close without mentioning one 
population of high-redshift galaxies that we have ignored 
completely. These are the bright (K ^ 20) near-infrared- 
selected galaxies. Their reported star-formation rates 
(~ 200M Q /yr), correlation lengths (> 9ft," 1 Mpc, Daddi 
et al. 2004) and stellar masses (2 x IO^Mq; van Dokkum 
et al. 2004) are extraordinary, far larger than the cor- 
responding values for typical galaxies in our samples. 8 
Galaxies with similarly extreme properties are not a neg- 
ligible component of the high redshift universe. The 
shape of the 850/mi background implies that up to a 
third (Cowie, Barger, & Kneib 2002) of all stars could 
have formed in objects with star-formation rates greater 

8 These values are not larger, however, than those for galaxies 
in our samples with similar K magnitudes. See, e.g., Shapley et 
al. 2004 and Adelberger et al. 2005 
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than ~ 200 Af Q ; halos at z ~ 3 with the large masses 
M £ 1O 12 7 M implied by r ~ 8-9/j- 1 Mpc contain in 
total almost 20% as many baryons as the more numer- 
ous and smaller halos with M ~ 10 115 M Q that contain 
LBGs; objects with stellar masses M* > 2 x 10 11 M Q con- 
tain nearly 5% of all stars in the local universe (Kauff- 
mann et al. 2003) and 20% of the stars in local ellipti- 
cal galaxies (Padmanabhan et al. 2004). No treatment 
will be entirely complete if it neglects galaxies similar to 
those found in near-IR surveys. The galaxies we stud- 
ied are neither the most massive, nor the most rapidly 
star-forming, nor the most clustered galaxies in the high 
rcdshift universe, but it is precisely this that makes them 
plausible progenitors for the early-type galaxies that sur- 



round us today. 
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